Regular Expression Matching for Multi-script Databases
نویسنده
چکیده
Modern database systems mostly support representation and retrieval of data belonging to different scripts and different languages. But the database functions are mostly designed or optimized with respect to the Roman script and English. Most database querying languages include support for regular expression matching. However the matching units are designed for the Roman script, and do not satisfy the natural requirements of all other scripts. In this paper, we discuss the different scripts and languages in use in the world, and recommend the type of regular expression support that will suit the needs for all these scripts. We also discuss crosslingual match operators and matching with respect to linguistic units.
منابع مشابه
Unicode Canonical Decomposition for Hangeul Syllables in Regular Expression
Owing to the high expressiveness of regular expression, it is frequently used in searching and manipulation of text based data. Regular expression is highly applicable in processing Latin alphabet based text, but the same cannot be said for Hangeul∗, the writing system for Korean language. Although Hangeul possesses alphabetic features within the script, expressiveness of regular expression pat...
متن کاملA Musical Regular Expression Matching System for Relational Databases
We present a large scale model for a database of multi-track polyphonic songs searchable via a modified regular expression format and stored in a relational database. The regular expression format is of our own design and will likely be familiar to those with experience using common text search utilities. The relational database is a standard, off-theshelf SQL-based database management system. ...
متن کاملAssessing clinical reasoning skills using Script Concordance Test (SCT) and extended matching questions (EMQs): A pilot for urology trainees
Introduction: Clinical reasoning skill is the core of medicalcompetence. Commonly used assessment methods for medicalcompetence have limited ability to evaluate critical thinking andreasoning skills. Script Concordance Test (SCT) and ExtendedMatching Questions (EMQs) are the evolving tests which areconsidered to be valid and reliable tools for assessing clinicalreasoning and judgment. We perfor...
متن کاملPerformance Evaluation of Regular Expression Matching Engines Across Different Computer Architectures
Regular expressions are sequences of characters that define search patterns, commonly used in pattern matching with strings. Regular expression matching plays an important role in a variety of applications, such as bioinformatics, network inspection, etc. However, it is a challenging problem because pattern matching is a computationally intensive operation especially when dealing with large dat...
متن کاملA Multi-pattern Matching Algorithm Based on WM Algorithm
The research on the algorithms of pattern-matching is an important subject in the field of computer study. The algorithms can range from single-pattern matching and multipattern matching algorithms to extended characters matching and regular expression. Among the many multi-pattern matching algorithms, AC algorithm and WM algorithm would be the two most classical algorithms, but these two algor...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- IEEE Data Eng. Bull.
دوره 30 شماره
صفحات -
تاریخ انتشار 2007